† Corresponding author. E-mail:
Project supported by the National Natural Science Foundation of China (Grant Nos. 61773399, 61673389, and 61273202) and the Special Funded Project of China Postdoctoral Science Foundation (Grant No. 2017T100792).
To explore the influence of quantum information on the common social problem of honesty and trickery, we propose a Bayesian model for the quantum prisoners’ dilemma game. In this model, the players’ strategy formation is regarded as a negotiation of their move contract based on their types of decision policies, honesty or trickery. Although the implementation of quantum information cannot eliminate tricky players, players in our model can always end up with higher payoffs than in the classical game. For a good proportion of a credibility parameter value, a rational player will take an honest action, which is in remarkable contrast to the observation that players tend to defect in the classical prisoners’ dilemma game. This research suggests that honesty will be promoted to enhance cooperation with the assistance of quantum information resources.
Cooperation among players is one of the most concerned problems in game theory.[1,2] A variety of concrete game models examine whether a Nash equilibrium solution is Pareto optimal,[3–6] and the critical research question is whether a mutually beneficial cooperation can be spontaneously formed and maintained among players. In classical non-cooperative games, both strategies and moves are decided individually and players’ rational strategies often end up with payoffs lower than what they can get through cooperation. In contrast to the aforementioned observation, strategies and moves could be correlated in quantum games even if players conduct no classical communications and make only local operations. The entanglement and interference of quantum information makes players’ decision-making a quantum process.[7,8] With the help of quantum correlation, players in quantum games can negotiate their moves for cooperation by using quantum information. This property makes it possible to make cooperative moves in a non-cooperative quantum game, thereby facilitating players to solve their classical dilemmas.[9–12]
From the viewpoint of players’ cooperation, we can generally divide a quantum game into two stages: strategy negotiation and move decision. At the stage of strategy negotiation, players conduct strategy operations on the local part of entangled qubits and develop together a quantum state as their move contract. At the stage of move decision, the state of such quantum contract are measured in certain bases to define players’ moves according to their decision policies. Previous studies on quantum games generally focus on one of these two stages. For instance, the Eisert–Lewenstein–Wilkens quantum game scheme[9,13–16] concentrates on the quantization of strategy negotiation process, where strategy operations are expanded from classical bit operators to unitary quantum gates, and the measurement basis for move decision is assumed to be constant. In contrast, players adjust the measurement basis in Marinatto’s scheme[17–21] and Bayesian quantum games[22–28] to optimize their move decision policies. Entanglement improves players’ payoffs and quantum uncertainty keeps the fairness of the game.[29] On the other hand, the stage of strategy negotiation is omitted and players’ move contract is restricted to be some given quantum states.
Based on our generalized two-stage quantum game model, we conduct an investigate into the influence of the tricky action on cooperation in this article. In Eisert’s model, a quantum cooperation contract is achieved by players with the assumption that they all honestly take the established decision policy. In Marinatto’s model and Bayesian games, players are considered to follow the given move contract in determining their moves. It appears that many new issues may arise in quantum games where both strategies and decision policies can simultaneously be determined by players. Here we follow Eisert’s model and concentrate on a new quantum prisoners’ dilemma game, where players are out of supervision. In this game, players may not faithfully follow the established decision policies, but take the tricky action of unilateral changing decision policies. It can be expected that such tricky action may lead to a player’s inconsistent moves and, hence, the collapse of existing Nash equilibria. From this perspective, we will carefully examine the Nash equilibria and corresponding strategy for cooperative move contract in quantum games with tricky players.
In the following, we will view our quantum prisoners’ dilemma model as a Bayesian game in order to clarify the difference between player’s strategy operations and the modification of decision policies. We consider that players with different decision policies are of different types, honest and tricky. A player has only incomplete information about the types of his/her opponents. Following previous studies on quantum prisoners’ dilemma, we take the standard orthonormal basis as players’ default move decision policy, and regard the change of decision policy as tricky action. By introducing and employing a parameter of credibility to depict the probability that the players are honest or tricky, we study the influence of the tricky action on the Nash equilibria and the cooperation between players, and derive mixed strategies for Nash equilibria under typical quantum strategy operations. In addition, we discuss the positive effect of quantum entanglement and coherence on classical social cooperation.
The rest of our manuscript is organized as follows. In Section
The process of a typical two-player Alice and Bob 2 × 2 quantum game[9,17,23] can be characterized by Fig.
The implementation of quantum information can help players improve their payoffs in games. As a famous example, the prisoners’ dilemma is resolved when quantum strategies are available.[9] Table
![]() | Table 1. Payoff matrix of the classical prisoners’ dilemma game. C (cooperation) and D (defect) are moves as that players Alice and Bob can make in the game.[9] . |
In previous studies, quantum contracts are established on the assumption that both players take the default decision policy VA = VB = I. If the decision policy can be modified, players may break their cooperative contract again. As a typical case, we consider that player i can deceive his/her opponent by changing the decision policy from Vi = I to Vi = σx, and show the corresponding moves and payoffs in Table
![]() | Table 2. Moves and corresponding payoffs of different types of players {(mA,mB),($A,$B)}. Players’ moves are decided under move contract |ψ〉S = |0〉A ⊗ |0〉B. The decision policy of player is either PH = I (honest) or PT = σx (tricky). . |
To address the aforementioned problems, we expand the quantum prisoners’ dilemma game[9] to a Bayesian version. In this model, the players should determine both their strategies and decision policies. Our investigation focuses on the non-entanglement initial state |ψ〉AB = |0〉A ⊗ |0〉B and the the maximally entangled operation J and J†. We consider that only three typical strategies, SC = I, SD = iσy and SQ = iσz, and two decision policies, PH = I and PT = σx, are available to players.
Table
![]() | Table 3. Payoff matrix of the maximally entangled quantum Bayesian prisoners’ dilemma game. The quantum strategy operators available to players are SC = I, SD = iσy and SQ = iσz. In this game, players are either honest or tricky. PH and PT correspond to player honestly keeping the default decision policy or taking the tricky action, respectively. . |
In this section, we derive the Nash equilibrium of the quantum Bayesian prisoners’ dilemma game. The quantum cooperation contract and corresponding payoffs are analyzed under different social credibility p. According to Table
In the following discussion, we denote the probabilities of player i taking operators Sm by





We denote the Nash equilibrium mixed strategy of player i = A, B as





Players i = A, B can arrive at the Nash equilibrium according to Scheme 2a when creditability p ∈ [ 0, 1/4 ), With this scheme, an honest player will end up with an expected payoff
When p ∈ [5/7,1], an honest player can get an expected payoff
In the aforementioned discussions, we have obtained mixed quantum strategies for Nash equilibrium under different creditability p. It should be noted that there are more than one set of Nash equilibrium strategy in some regions of p. In these cases, rational players would prefer the strategies that maximizes the sum of the expected payoff of both honest and tricky players,
![]() | Table 4. The Nash equilibrium mixed strategies and expected payoffs for creditability p ∈ [0,1]. The payoff of an honest player is always higher than a tricky player for p ∈ [0,1/4). With p ∈ [1/4,5/7), honest and tricky players will get the same expected payoff. Only when creditability p ∈ [5/7,1] will the payoff of an honest player be exceeded by a tricky player. When p ≤ 5/7, a rational player would not benefit from taking a tricky action. . |
As mentioned in Section
When p ∈ [0,1/4), an honest player will always conduct quantum operation SQ. Thus the quantum prisoners’ dilemma game between two honest players will end up with the Pareto optimal Nash equilibria, where payoff
It should be noted that we have made full use of quantum entanglement in this quantum game. The strategy operations implemented in forming move contracts includes both the classical strategy SC = I, SD = σx and the optimal quantum strategy of the quantum prisoners’ dilemma game[9] SQ = iσz. Although there are still other available operations in a quantum game, the most typical quantum strategy operations are discussed in our study. In addition, these operations are conducted on a maximally entangled quantum state. Hence our cooperation contract ρT is a coherent superposition of different pure strategies, rather than a simple tensor product of each players’ strategies. It is the entanglement and coherence in players’ strategy that makes quantum contract different from the classical one and provides our game with advantage over the classical version.
In summary, a new quantum Bayesian prisoners’ dilemma game model is proposed to investigate the influence of tricky actions on quantum games. In this model, players would not only decide their quantum strategies, but also adjust their decision policies for actual moves. We treat the unilateral change of the default decision policy as a player’s tricky action. It is assumed that a player has no certain information about whether his/her opponent is honest or tricky. Instead, the players share common knowledge of the probability of a player being honest, characterized by a parameter of credibility p. A thorough study is carried out to examine the equilibrium of this incomplete information game under typical quantum strategy operations. Although quantum games can not eliminate tricky actions, the two players can always achieve higher payoffs than those in the classical game. For a good proportion of the creditability parameter p, a rational player will take an honest action. This is in contrast to the observation that the players always defect in the classical game. This research suggests that honesty will be promoted to enhance cooperation in social affairs with the assistance of quantum information entanglement and coherence.